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CONTRIBUTIONS FROM THE ZOOLOGICAL LABORATORY OF THE 

MUSEUM OF COMPARATIVE ZOOLOGY AT HARVARD 

COLLEGE, E. L. MARK, DIRECTOR, NO. LXXIX. 

A MEASURE OP VARIABILITY, AND THE RELATION 

OF INDIVIDUAL VARIATIONS TO SPECIFIC 

DIFFERENCES. 

By Edwin Tenney Brewster. 

Presented by E. L. Mark, April 14, 1897. 

Problem. 

This paper, prepared under the supervision of Dr. C. B. Davenport, 
deals with an inquiry into the relation between those small variations 
which distinguish individuals of the same group, and those larger differ- 
ences which separate species and genera. 

For the prosecution of such an inquiry, it is, first of all, necessary to 
devise a method of measuring variability. Quetelet ('46), Stieda ('82), 
Galton ('91), and Weldon ('93) have shown that variations in organisms 
follow the well known laws of the distribution of error. Thus the ordi- 
nary methods of treating problems in error of observation may be made 
to furnish a measure of variability ; it is upon these methods that the 
methods of this paper are based. 

Method. 

Any measurable quality of an object has a value, which is expressible 
by a number. A series of such numbers, expressing the varying value 
of a quality throughout a group of similar objects, will have a mean, 
about which the quantities are arranged in accordance with the law of 
distribution of error. Such a series is, for the present purpose, essen- 
tially like a series of slightly erroneous observations of a single quantity 
distributed about the true value. What, therefore, would be the proba- 
ble error in the latter case, is a measure of variability in the former. 

Suppose such a series of numbers obtained by measuring some single 
quality in each individual of a natural group of organisms. Let there be 
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n individuals, and let d x , d^ ... d n represent the difference between each 

number of the series and the mean of all. 

Then 

0.8453 ((f t + d 2 ...d n ) 

V» (» — 1) 

which is the common working formula for probable error,* is a measure 
of the variability of the given quality in this particular group. 
Or, in symbols, 

lr 0.8453 2 d 

Vcc — : 

\/n (» — 1) 

where V stands for variability, 2 d = d 1 + rf 2 . . . d n , and cc may be read 
" is measured by." This is, appproximately, Galton's Q. 
Obviously, 

V<* , • (Formula 1.) 

V» (n — 1) v y 

In Formula 1, 

V» («-!) = V^ 11 ^ = Y« 2 (l - i) = n i/l - i . 

V n 



lr %d 1 

Tex X 



^ /7 

Consider the expression, 

n 

Here 2 d is the sum of all differences between single numbers of the 

series and the mean of the series. Since these are n in number, — is 

n 

the average deviation of single values from the mean value. 
Let a d be the symbol for this average deviation. Then, 

Vx ad , * . (Formula 2.) 

V n 

Suppose, however, that the number of cases measured is large ; that 
is to say, that n is made indefinitely large. As n approaches infinity, 

- approaches 0, and consequently — = approaches 1. 

vA-- 

V n 



* See any text-book treating of such subjects ; for example, Merriman ('84), p. 93. 
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If, then, n be taken sufficiently large, 

Vex. ad. (Formula 3.) 

Even if n is not large, Formula 3 may usually be employed in practice. 
For in any single investigation n is likely to be constant, — when, as 
will often be the case, different qualities of the same individuals are to be 
compared it is necessarily so, — but the introduction of a constant factor 
will not affect the correctness of the formula. 

Practically, a d may be found accurately enough without the labor of 
subtracting each quantity from the mean. 

Let 

a + -, a + d, a + c, a + b, a — b, a — c, a — d a — - 

2 '2 

be a series of quantities, n in number, distributed according to the law of 
probability. The mean is evidently a, and the differences between the 
several quantities and the mean are 

-,..-. d,c,b,b, c,d, _ . 

jj. ... + d+c + b + b + c + d.... + % 

j " 2 
.•. ad= — — 



n 
25 + 2e+ 2d .... + n 



(Formula 4.) 



Or, again, 



a + -, .... a + d,a + c, a + b, a — b, a — c, a — d, a — - , 

& 2 

being the series of n measurements as before, the sum of all terms greater 
than the mean is 

(« + |J • ■ • • + (a + d) + (a + c) + (a + b), 



and of these the mean is 



na , n 

-j+ r -- - + d+c + b 

n 
2 



272 PROCEEDINGS OP THE AMERICAN ACADEMY. 



In like manner, the sum of all terms less than the mean is 

na n 

T _ 2' 



na n , , 
.... — a — c — o 



2 
One half the difference between these is 

26 + 2e + 2d.... n 



(Formula 5.) 



But Formula 5 is identical with Formula 4, and is therefore the formula 
for the average deviation. 

Hence the average deviation may be found by separating the numbers 
into two groups, — one of which shall contain all quantities greater than 
the mean, the other all those less than the mean, — and taking half the 
difference between the means of each group. 

In practice there will usually be several observed values equal to the 
mean ; these may be distributed to make the two groups of equal size. 
If the number of observed values is odd, one of these mean values may 
be neglected. 

In order to compare the average deviation of one group of numbers 
with that of another which has a different mean, it is necessary to reduce 
the two to a common measure. This is most simply done by dividing 
each average deviation by the corresponding mean.* It is proposed to 
call this ratio the coefficient of variability and to designate it by the 
symbol C. V. 

* The justification for this procedure is found in the following considerations: 
The relative size of the average deviation of two organs depends very largely upon 
the relative size of these organs. Where the mean dimension is large, we expect 
a greater average deviation than where it is small. Thus the average deviation 
of the stature of adult British males from the mean is about 2 inches. An aver- 
age deviation of 2 inches in the length of the nose, in any race, would clearly 
indicate a much greater variability in the nose length than in stature. In compar- 
ing the variability of two such diverse measures as stature and nose length, it 13 
better to compare the ratios of the average deviations to the mean dimension. 
Thus, since the mean stature of adult British males may be taken at 67 inches, 
variability in stature may be expressed by the ratio & = .02985. This number 
indicates that the average deviation from the mean stature is about three one- 
hundredths of the mean stature ; which is clearly more important than to say that 
it is 2 inches. Moreover, this method of expression has the advantage that it is 
independent of the unit in which the dimension is measured, whether feet, milli- 
meters, grams, degrees, or ergs. — C. B. Davenport. 
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In practice C. V. may be found most readily by separating the given 
numbers into two groups, as discussed above, and finding the mean of 
each. The difference between these two means, divided by their sum, is 
the O. V. of the structure under consideration. 

Application of the Method of Measuring Variability to the Problem of the 
Relation between Individual Variations and Specific Differences. 

Thesis. 
While it is generally agreed that specific characters are more subject to 
striking variations in individuals than are the characters common to allied 
species, it is not clear how far this relation extends. I wish to show that 
what is true of obvious variations and sports is also true of those minute 
differences between individuals which only careful measurements can 
detect ; or, in other words, that any measurable quality is, in general, 
variable in individuals in proportion as it is a distinguishing character 
of the group to which the individuals belong. 

Evidence. 

Table A is based on the extensive tables of body measurement of 
twenty races of men, which are given by Weisbach ('78) in the Appendix 
to his " Korpermessungen," etc. Each of the numbers of the first eight 
columns is the G. V. of a single dimension of a single race, and the ninth 
column gives the average for eight races of the 0. V.'s of each dimension. 
The values of these 0. V.'s only approximate to the true values, for the 
reason that the number of individuals measured is not sufficiently large 
to eliminate all accidents of age and sex. 

An examination of Table A shows that these eighteen dimensions are 
of nearly the same relative variability in each of the eight races. This 
fact is well brought out by Table B, in which the largest number of each 
column of Table A is replaced by 1, the next smaller by 2, and so on up 
to 18. From Table B it appears that certain dimensions — as, for ex- 
ample, the height of the forehead — are always decidedly variable; and, 
on the other hand, it appears that other dimensions — such as the length 
and breadth of the head — are more constant. 

The last column of Table A, "Mean of 20 Races," gives the coef- 
ficient of variability, not of any individuals, but of the means of each of 
the twenty races of Weisbach's tables. That is, the mean value of a 
dimension in each race is treated as are the dimensions of individuals in 
other columns. This column, therefore, shows the distribution of racial 
differences in the same way in which the remainder of the table shows 
vol. xxxii. — 18 
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Head length . . . 
Head breadth . . 
Nose length . . . 
Nose breadth . . . 
Nose height . . . 
Forehead height 
Under jaw length . 
Mouth breadth . . 
Upper face breadth 
Lower face breadth 
Upper arm length . 
Forearm length 
Middle finger length 
Hand width . . . 
Upper leg length . 
Lower leg length . 
Foot length . . . 
Foot breadth . . . 
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the distribution of individual variations. To show the connection between 
individual variations and racial differences, I take all possible pairs of 
coefficients in the first nine columns of Table A and compare them with 
the corresponding pairs in the last column. When, in one of the first 
uiue columns, the member of a pair which is larger is larger in the tenth 
column also, it counts one in the line at the bottom of Table A marked 
" Cases of Agreement." When, however, the larger member of a pair 
in one column is the smaller in the other, it counts one in the line marked 
" Cases of Disagreement." For example, in the column of Slavs, the 
C. V. of the head length is 0.0314, that of the head breadth is 0.0345, 
the head breadth being the more variable. In the column of racial dif- 
ferences also, the G. V. of the head breadth is greater than that of the 
head length ; therefore this gives in column of Slavs one " Case of 
Agreement." On the other hand, for the Slavs, the heights of the fore- 
head and of the nose do not agree in relative variability with the means 
of the twenty races given in the last column ; consequently this compari- 
son gives a " Case of Disagreement." 



TABLE B. 
Showing Order op Magnitude op C. V. of Table A.* 
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Forehead height .... 
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Under jaw length . . . 
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Upper arm length . . . 
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* I. e. in column "20 Races," head length has C. V. smallest; head breadth, next 
larger ; C. V. of nose height is largest. 
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It results, then, from the two lines of numbers at the bottom of Table A, 
that, in spite of the somewhat inaccurate values of this coefficient, in from 
two thirds to four fifths of the comparisons, that dimension in any pair 
which is the more variable among individuals of the same race is likewise 
the dimension which is the more diverse in the various races. Inciden- 
tally it may be noted, too, that the measurements of the face have, in 
general, the largest coefficients, and thus the high value for personal 
identification which has long been accorded to photographs of the face 
alone is justified. 

TABLE C. 

C. V. op Various Dimensions of Skulls of a Genus (Lepus) of Rodents. 
(Based on Coues and Allen, 77, pp. 222-226.) 



Total length 

Greatest width 

Distance between orbits 

Nasal bones, length 

Nasal bones, width behind 

Nasal bones, width before 

Upper incisors to molars 

Upper incisors to hinder margin of palate . 

Upper incisors, height 

Upper incisors, width . . 

Length of upper molars 

Width between upper molars 

Lower jaw length 

Lower jaw height 



Cases of Agreement 



Cases of Disagreement 



0.121 

0.0935 

0.0825 

0.122 

0.150 

0.178 

0.125 

0.0799 

0.146 

0146 

0.0958 

0.14 

0.124 

0.122 



0.0438 
00272 
0.0691 
0.0732 
0.0684 
0.0760 
0.0527 
0.0477 
0.0364 
0.0502 
0.0459 
0.0348 
0.0491 
0.0329 



55 



36 



0.0178 
0.0239 
0.0475 
0.0348 
0.0327 
0.0405 
0.0280 
0.0238 
0.0568 
0.0434 
0.0393 
0.0452 
0.0240 
0.0174 



59 



32 



0.0308 
0.0255 
0.0583 
0.0540 
0.0505 
0.0582 
0.0403 
0.0357 
0.0466 
0.0468 
0.0426 
0.0400 
0.0365 
0.0251 



60 



31 



Table C shows for a genus of rodents what Table A shows for races of 
mankind. The first column is computed from the means of twelve species 
and subspecies of the genus Lepus. This gives, therefore, the relative 
value of these dimensions as specific and subspecific differences. The 
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remaining columns give the relative variability of the several dimensions 
in individuals of the two species. The numbers at the bottom of the table 
— " Cases of Agreement " and " Cases of Disagreement " — are found as 
in Table A. Table C is much less satisfactory than Table A, because 
the differences in variability between different dimensions are here small, 
and therefore the relative sizes of the coefficients are more at the mercy 
of accident. Even here, however, there is a decided preponderance of 
cases in which individual variation is directly correlated with specific 
difference. 

TABLE D. 

C. V. of Four Measurements of Two Genera or Rodents. 
(Based on Miller, '93 and '93 a .) 





Tail. 


Hind Foot. 


Ear. 


Tail and 
Body. 




0.033 
0.054 


0.027 
0.034 


0.028 
0.070 


0.025 
0.043 




0.043 


0.030 


0.049 


0.034 




Tail. 


Hind Foot. 


Ear. 


Body. 


90 Sitomys americanus canadensis 


0.087 
0.070 


0.028 
0.031 


0.050 
0.052 


0.056 
0.049 


Mean 


0.068 


0.029 


0.051 


0.053 





Table D is computed from the measurements given with the descrip- 
tion of two new species of rodents.* Zapus insignis and Z. hudsonius 
are reported to resemble each other closely, but to differ in the length of 
the ear. Sitomys americanus and S. americanus canadensis are described 
as nearly alike except in the length of the tail. Table D shows that, of 
the four dimensions which are recorded for these species, the mean C. V. 
of the ear is greatest in Zapus, and of the tail greatest in Sitomys. The 
O. V. in each single species, too, is greatest for the characteristic dimen- 
sion, except in the case of Z. insignis. Possibly Z. insignis is the con- 



* Miller ('93), pp. 1-8 and 55-70. 
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stant parent species from which Z. hudsonius, with its remarkably high 
coefficient for the ear, has separated. At any rate, in each genus, the 
characters which mark the species have the highest mean coefficient. 

TABLE E. 

C. V. op Measurements of Three Carnivora. 

(Measured by the Author.) 



o a 

J 1 



a" 






% 



Length of nasal bones 

Length of frontals 

Length of snout, incisors to margin of palate . 

Total basal length 

Length from posterior nares to occipital foramen 
Length of parietal and occipital bones . . . 

Length' of tooth series 

Width between canines 

Width between maxillary bones 

Width of zygoma 



0.050 
0.038 
0.039 
0.028 
0.024 
0.030 
0.044 
0.041 
0.045 
0.049 



0.054 
0.056 
0.062 
0.054 
0.058 
0.046 
0.050 
0.055 
0.040 
0.054 



0.039 
0.031 
0.033 
0.029 
0.026 
0.027 
0.024 
0.045 
0.028 
0.044 



0.052 
0.047 
0.050 
0.041 
0.041 
0.038 
0.047 
0.048 
0.042 
0.051 



0.048 
0.041 
0.045 
0.037 
0.036 
0.034 
0.039 
0.047 
0.038 
0.049 



Table E gives the G. V.'s of ten dimensions of the skulls of the cat, 
fox, and lynx. A comparison of the skulls of these three forms shows 
that the representatives of the cat and dog families are very similar, ex- 
cept for the length of the muzzle and # the shape of the zygoma. A short 
muzzle and a wide zygoma are characteristic of all the cats,* and nowhere 
is the difference between the skulls more marked than in the length of 
the nasal bones. Examination of Table E shows that it is in just these 
dimensions — namely, those of the front part of the skull, the zygoma, 
and especially the nasal bones — that individuals are most variable ; while, 
on the other hand, the " total basal length " and the dimensions of the 
back part of the skull, which are alike in the two families, give the 
smallest coefficients of variability. 



* Flower (70), p. 142. 
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Conclusion. 

These four cases are the only ones to which I have applied this method 
of measuring variability. They are, however, taken entirely at random, 
and are in no wise selected cases. The conclusion to which they all 
point is that which, on general grounds, seems most likely to be true. 
This conclusion is, that there is so intimate a causal connection between 
the characters of individuals and those of the allied groups into which 
they are combined, that, in proportion as any character is variable in the 
individuals of one group, it is different in the allied groups. 

Finally, I have to express my great indebtedness for assistance, criti- 
cism, and suggestion to Dr. Charles B. Davenport and Mr. Frederick H. 
Safford, both of Harvard University. 

WOLFBOKOUGH, N. H., 

March 5, 1896. 
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